PyDigger - unearthing stuff about Python

Found 6 out of 307,797. Showing 6 on page 1. Total pages: 1.

Name	Version	Summary	date
docstrange	1.1.1	Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, JSON, CSV, HTML) with intelligent content extraction and advanced OCR.	2025-08-06 10:34:41
kreuzberg	3.10.1	Document intelligence framework for Python - Extract text, metadata, and structured data from diverse file formats	2025-07-31 11:54:20
document-data-extractor	1.0.4	Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract	2025-07-29 08:25:56
llm-data-converter	2.2.0	Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract	2025-07-25 13:32:07
mseep-kreuzberg	3.8.2	Document intelligence framework for Python - Extract text, metadata, and structured data from diverse file formats	2025-07-17 03:32:28
extractable	1.0.2	Extract tables from PDFs	2024-05-31 09:37:17

Found 6 out of 307,797. Showing 6 on page 1. Total pages: 1.